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I  EXECUTIVE  SUMMARY 


We  outline  a  formalism  for  modeling  structures  containing  birefringent  elements  and  directional  couplers. 
The  traditional  description  of  couplers  (as  single  polarization  devices  with  two  input  and  output  ports)  and 
fibers  (as  single  port  devices  with  two  polarizations)  is  generalized  into  a  four-dimensional  vector  space 
with  convenient  co-ordinate  transformations  between  the  traditional  views.  This  enables  system  modelers  to 
introduce  complex  elements  and  scenarios  under  a  single  formalism,  switching  between  the  two  traditional 
views  with  a  single  4x4  matrix  transformation.  We  generalize  an  earlier  formulation  to  include  extra 
degrees  of  freedom  in  rotation,  coupling,  and  birefringence  matrix  operators.  The  technique  is  applied  to  a 
microwave  photonics  setting  and  a  quantitative  description  is  developed  that  illustrates  how  normal  mode 
loss  in  the  interferometer’s  couplers  leads  to  a  polarization-independent  phase  offset  in  the  output  arms. 
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II  INTRODUCTION 

Optical  directional  couplers  enabled  advances  in  fiber  optic  sensor  technology [1],  including  interfer¬ 
ometric  demodulation  in  microwave  photonic  links[2].  The  development  of  integrated[3]  Mach-Zehnder 
interferometers  (MZI)  has  also  enabled  dramatic  advances  in  digital  optical  communications,  which  in¬ 
creasingly  use  phase  modulation  to  avoid  optical  impairments[4,  5],  Evolving  system  metrics  will  inevitably 
tighten  component  and  subsystem  requirements,  driving  more  research,  experimentation,  and  development 
to  meet  stricter  specifications,  including  detailed  system  dependence  on  optical  polarization.  However, 
modeling  polarization  dependence  can  be  complicated  by  the  fact  that  optical  couplers,  and  the  waveguides 
attached  to  them,  have  different  natural  descriptions. 

Consider  the  Mach-Zehnder  interferometer  (MZI)  illustrated  schematically  in  Fig.  1.  Waveguides  F\ 
and  F2  carry  optical  signals  to  coupler  C\.  The  natural  descriptions  for  each  of  the  fibers  are  2  x  1  Jones 
vectors  which  describe  polarization  evolution.  We  call  this  the  “waveguide  view:”  the  waveguides  are  con¬ 
sidered  independent,  but  their  polarizations  may  couple  during  propagation.  On  the  other  hand,  directional 
couplers  are  usually  treated  in  the  scalar  “coupler  view:”  the  waveguides  are  viewed  as  coupled,  but  only  a 
single  polarization  is  considered.  Since  the  waveguide  view  (coupled  polarizations,  but  independent  wave¬ 
guides)  and  the  coupler  view  (coupled  waveguides,  but  independent  polarizations)  are  fundamentally  differ¬ 
ent,  polarization  in  couplers  is  often  ignored  as  a  practical  matter. 


f; 


Fig.  1:  Basic  Mach-Zehnder  interferometer.  Fibers  and  couplers  have  different  natural  descriptions  of  the 
interactions  between  fields. 

Treatments  of  polarization  dependence  in  couplers  are  generally  based  in  the  component  technology: 
for  example,  a  rigorous  development  by  Chen  and  Burns[6]  treated  two  birefringent  waveguides  having 
arbitrary  orientation  and  twist,  with  coupling  between  similar  polarizations.  Such  solutions,  which  integrate 
a  4  x  4  differential  equation,  are  more  suitable  to  device  research  than  quick  system  modeling.  In  another 
approach,  more  amenable  to  system  modeling,  the  “fiber  view”  was  used  exclusively [7]:  the  waveguides  of 
Fig.  1  were  tracked  throughout  the  structure,  including  through  the  couplers.  The  couplers  were  modeled 
as  4  x  4  matrices  with  two  degrees  of  freedom  (DOF),  one  DOF  for  each  polarization.  The  4x4  matrices 
they  proposed  could  be  considered  as  a  generalization  of  a  Jones  matrix[8],  with  all  the  attendant  modeling 
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advantages. 

This  work  generalizes  that  recent  paper  in  several  ways.  We  cast  the  problem  in  a  4-dimensional  space, 
as  in  [7],  but  we  also  introduce  a  simple  co-ordinate  transformation  between  the  fiber  view  and  the  coupler 
view  that  permits  switching  back  and  forth  between  the  views  as  needed.  This  permits  symbolic  reduction 
of  the  4  x  4  matrices  to  2  x  2  blocks  and  allows  easier  extensions  in  both  the  fiber  and  coupler  views.  We 
show  later  that  this  permits  an  easy  extension  of  the  model  to  normal  mode  loss,  for  instance.  Second,  we 
generalize  the  expressions  for  the  birefringent  waveguide  sections  to  cover  two  degrees  of  freedom  (DOF) 
for  the  birefringence  eigenstates,  rather  than  one.  In  a  similar  vein,  we  generalize  the  coupler  expressions 
to  three  DOF  instead  of  one,  so  that  modeling  of  extremely  broadband  devices  can  be  implemented,  for 
example.  In  this  vein,  we  generalize  the  rotation  operator,  as  well.  We  also  describe  the  formalism  using  a 
geometrical  representation,  so  that  the  comparisons  and  differences  are  more  readily  viewed. 

Ill  DESCRIPTION  OF  FORMALISM 
Overview 

Our  basic  approach  is  illustrated  in  Fig.  2,  upper  figure  (a)  of  which  is  an  exploded  view  of  a  coupler 


Fig.  2:  State  space  approach,  (a)  Physical  view  of  input  fibers,  coupler,  and  output  fibers  (b)  Exploded 
view  showing  ‘x’  (dashed)  and  ‘y’  (solid)  polarizations  for  the  input  and  output  waveguides  (1,2)  and  for 
the  coupler,  which  is  viewed  conceptually  as  two  2x2  couplers,  one  for  each  polarization.  Transformation 
matrix  ‘S’  changes  fields  from  the  waveguide  view  to  the  coupler  view. 

with  pigtails,  while  lower  figure  (b)  is  a  detailed  view  of  the  waveguides,  couplers,  and  their  fields.  In  the 
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fibers  and  in  the  couplers,  the  x  and  y  polarizations  are  indicated  by  dashed  and  solid  lines,  respectively. 

The  physical  coupler  is  viewed  conceptually  as  two  couplers,  and  C^y\  shown  in  Fig.  2b:  there  is 
one  for  each  polarization,  and  each  acts  as  a  conventional  2x2  coupler.  Cross-coupling  between  these  two 
couplers  is  shown  as  arrows  which  change  their  polarization  characteristic  in  connecting  between  the  two 
planes.  In  matrix  form,  these  arc  represented  by  2  x  2  matrices  exy  and  eyx. 

The  coupler  formulation  is  different  than  the  waveguide  formulation.  In  the  waveguide  view,  a  two 
dimensional  state  vector  represents  the  two  components  of  that  waveguide’s  polarization,  so  that  the  wave¬ 
guides’  fields  are  given  by 


r  „(i)  i 

’  ’ 

<?«  = 

$x 

s(1) 

by 

= 

&x 

S{2) 

by 

The  joint  state  of  the  fields  leaving  both  waveguides  and  arriving  at  the  coupler,  z  =  zb  in  Fig.  2,  can  be 
represented  by  stacking  their  independent  2-dimensional  Jones  vectors  to  form  a  4-dimensional  vector  in 
the  waveguide  view  as 


?(!) 

s*(2) 


B 


s^Xzb) 

s^Xzb) 

Sx2\zB) 

.  s12){zb)  . 


(1) 


On  the  other  hand,  the  fields  in  the  coupler  view  arc  represented  by  two  2-dimensional  “port”  vectors, 
one  for  each  coupling  plane  in  Fig  2.  Thus,  for  example,  the  inputs  to  the  “x  polarization”  coupler  arc  given 
as  the  amplitudes  at  the  input,  zb ,  for  the  two  ports: 


P{x\zB) 


Pxl\zB ) 
Px2\zb) 


As  above,  this  notation  is  meant  to  signify  that  the  fields  arc  described  in  the  coupler  view,  with  the  super¬ 
script  referring  to  the  particular  polarization.  We  alert  the  reader  to  the  notation.1  Again,  as  in  Eqn  (1),  we 
construct  the  4-dimensional  vector  comprised  of  these  two  independent  2-dimensional  vectors  in  the  coupler 
view  as 


(x) 
(: V ) 


J  B 


Px1\zb) 

Px2\zb) 

pi1](ZB) 

-  Py2XzB )  . 


(2) 


But  the  port  amplitudes  at  interfaces  B  and  C  are  exactly  the  same  as  the  waveguide  amplitudes  there: 
both  descriptions  form  bases  that  span  the  4-dimensional  space  describing  the  fields.  Indeed,  it  must  be  true 
that 


„(!)  =  s(i).  „(2)  =  s(2).  (i)  =  (i) 

Fx  X  »  Jrx  ax  i  Jry  °y 


,(2) 


In  other  words,  the  amplitudes  are  the  same,  but  they  are  rearranged  when  we  change  the  basis  for  the  4- 
dimensional  space  (assumed  complete2)  from  the  waveguide  view  to  the  coupler  view.  This  rearrangement 
operation  is  a  co-ordinate  transformation  in  the  4-dimensional  space  and  is  symbolized  by  the  operator  S 
as  in  Fig  2.  After  traversing  the  coupler,  the  amplitudes  are  again  rearranged  from  the  coupler  view  to  the 


'in  the  coupler  view,  the  vector’s  superscript  on  the  left  is  the  polarization,  while  the  components'  superscripts  in  the  bracket  on 
the  right  are  the  port  numbers.  These  conventions  make  the  matrix  algebra  and  the  transformation,  described  later,  more  consistent. 
2  As  in  [7],  we  treat  losses  independently  with  distinct  polarization-dependent  elements. 
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waveguide  view  by  S”1,  whereupon  their  evolution  is  further  described  by  the  Jones  matrices  for  those 
waveguides. 

In  the  remainder  of  this  section,  we  will  look  in  more  detail  at  the  waveguide  view,  the  coupler  view, 
and  the  co-ordinate  transformation. 

Waveguide  View 

Referring  to  Fig.  2,  polarization  states  in  waveguides  1  and  2  evolve  from  point  A  to  B,  the  input  to  the 
coupler.  From  the  left,  in  the  waveguide  view,  propagation  from  A  to  B  in  each  fiber  is  described  as  a  Jones 
vector  evolving  under  the  influence  of  a  2  x  2  Jones  matrix.  Thus,  on  each  of  these  paths  the  Jones  vectors, 
s' W  and  s^2\  evolve  under  the  influence  of  their  individual  Jones  matrices  (i.e.  linear,  but  not  necessarily 
unitary)  for  their  respective  waveguides.  There  is  no  coupling  between  the  fibers,  so  the  off-diagonal  blocks 
arc  zero  and  we  can  view  the  entire  process  in  2-dimensional  blocks: 


(3) 


The  absence  of  coupling  between  the  fibers,  evident  in  the  off-diagonal  zero  block  matrices,  is  reflected  in 
two  independent  Jones  matrix  evolutions 

s^(zb)  =  J 

for  waveguides  *  =  1,2. 


Coupler  View 

The  formulation  for  a  standard  4-port  directional  coupler  analysis  has  the  same  mathematical  structure 
as  the  Jones  calculus,  but  a  different  interpretation.  The  2x1  vectors  in  the  coupler  view  now  represent 
the  coupled  scalar  fields  at  the  two  physical  input  ports  or  output  ports,  and  the  coupling  is  described  with  a 
2x2  coupling  matrix.  The  usual  practice  is  to  analyze  a  single  polarization,  so  that  the  two  amplitudes  for 
the  x  polarization  at  the  output  ports  (zc),  for  example,  arc  linearly  related  to  the  port  amplitudes  at  zB,  the 
input  ports  by  coupling  matrix  Cl'k> 


Px\zC) 
_  Px\zc) 


c[?  ci?  l  r  p?\ZB) 

cl?  cl?  \  [  pi\zB) 


c(x) 


Px\zB) 

p£\zB) 


with  a  similar  equation  for  the  y  polarization,  and  where  we  have  introduced,  as  above,  the  2  x  2  block 
coupling  matrix  form  with  bold  font.  If  we  admit  the  possibility  of  coupling  between  the  polarizations,  in 
2-D  block  form  the  coupler  is  described,  similarly  to  Eqn  (3),  as 


p(x) 

p{y) 


'  exy  1  r  p(*)  ' 

.  £yx  CM  \  I  p(v ) 


(4) 


where,  for  example,  the  2x2  matrix  exy  couples  light  from  the  y  polarization  at  input  zB  to  light  in  the  x 
polarization  at  output  zc- 

We  argue  that  the  £VJ  matrices  should  be  small  for  several  reasons.  First,  the  overlap  integral  [13]  should 
be  small.  In  the  case  of  plane  waves,  it  would  be  identically  zero  for  the  orthogonal  x  and  y  polarizations,  and 
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in  adjacent  waveguides  it  should  be  small  and  dominated  by  minor  field  distortions.  Second,  the  construction 
of  the  coupler  should  force  a  natural  x  and  y  basis  by  virtue  of  the  planar  physical  symmetry  of  the  device 
imposed  by  its  substrate.  Any  birefringence  is  likely  to  be  in  this  basis  set,  and  thus  the  phase  matching 
between  the  two  x  (or  y)  polarizations  should  be  more  efficient  than  the  phase  matching  between  the  x 
and  y  polarizations.  That  is,  each  polarization  should  couple  more  strongly  to  its  adjacent  guide  for  similar 
polarizations  than  for  orthogonal  polarizations.  This  approximation  was  made  explicitly  in  earlier  work[6]. 
Since  both  of  these  effects  (i.e.,  very  weak  coupling  and  inefficient  phase  matching)  suggest  minimal  power 
transfer,  their  combination  leads  us  to  assume  negligible  coupling  for  our  basic  model.  That  is,  although  the 
model  does  not  require  the  e  to  be  zero  matrices,  we  assume,  as  earlier  [7],  that  they  can  be  ignored  and  we 
illustrate  the  model  with  that  assumption. 

In  matrix  form,  then,  in  traversing  the  coupler  from  zb  to  zc,  the  fields  can  be  described  in  the  coupler 
view  as 


p 

i 

c[{  0 

0 

pi ^ 

pip4) 

p{2) 

°21 

cg>  o 

0 

p(y) 

c 

Py] 

0 

o 

O. 

My) 

L  12 

Py] 

1 

to 

1 _ 

c 

0 

o  r ^ 
°21 

My) 
°22  -1 

1 

K 

to 

1 _ 

■  C(*)  0 

0  Cte)  _ 

pM  \ 

As  in  the  case  for  the  fiber  view,  the  off-diagonal  zero  block  matrices  indicate  the  lack  of  polarization 
coupling  in  the  coupler,  and  thus  there  arc  two  independent  2x2  block  coupling  matrices  on  the  diagonal 
for  the  two  polarizations 


P{i)(zC )  =  C  ®p®(zB), 

for  polarizations  i  =  x,y. 


Co-ordinate  Transformation 

The  block  diagonal  forms  for  the  waveguide  and  coupler  regions  would  naively  suggest  that  the  interfer¬ 
ometer  of  Fig.  1  could  be  treated  as  a  product  of  block  diagonal  matrices,  which  would  lead  to  two  separable 
two  dimensional  vectors.  Flowever,  the  J  and  C  matrices  have  different  bases  as  discussed  above.  Thus,  a 
co-ordinate  transformation  is  needed  to  move  between  the  two  physical  descriptions.  In  light  of  the  above 
equation,  one  sees  that  the  co-ordinate  transformation  taking  the  waveguide  view  to  the  coupler  view  is 


10  0  0 
0  0  10 
0  10  0 
0  0  0  1 


ST  =  S"1  : 


(5) 


in  other  words,  a  simple  transposition  of  the  2nd  and  3rd  rows  (or  columns).  Thus 


s£\zb) 

1  0  0  0  ' 

C/D 

p(,x ) 

Px\zb) 

s4\zb) 

0  0  10 

41\zb) 

_  C 

r  i 

p(y) 

B 

P^izB) 

41](zb) 

0  10  0 

Sx2\zb) 

—  o 

.  * (2) . 

B 

^to 

2L 

.  sl2\zB)  . 

0  0  0  1 

CO 

where  the  fields  arc  stacked  in  the  coupler  view  on  the  left  side  of  the  equation,  and  stacked  in  the  waveguide 
view  on  the  right  side  of  the  equation. 
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This  formulation  shows  that  at  point  B,  the  waveguide/coupler  junction,  we  can  transform  our  viewpoint 
by  making  a  co-ordinate  transformation.  That  is,  we  go  from  the  waveguide  view,  in  which  our  4-D  space 
consists  of  two  Jones  vectors  s' W  and  to  the  coupler  view  ,  in  which  case  the  space  is  spanned  by  port 
vectors  for  the  two  polarizations,  namely  p(x>  and  p!y> .  Because  S-1  =  S,  we  could  also  transform  the 
other  way: 


p(.x) 

p(y) 


B 


s' (2) 


=  S 


-l 


J  B 


fa) 

fa) 


=  S 


J  B 


fa) 

fa) 


J  B 


Interferometric  structure  example 

The  formalism  enables  us  to  write  down  structures  such  as  those  in  Fig.  3  by  operating  either  with  Jones 


Hr1  '  i  1 


~(d 


^2) 
^  /-tt  rt 


a 


y 


Fig.  3:  Interferometric  structure.  Input  fields  in  two  waveguides  propagate  through  sections  a,  (5,  and  7,  and 
are  coupled  in  couplers  C\  and  C'2 .  At  the  transitions,  the  co-ordinates  are  transformed  between  waveguide 
and  coupler  views  by  matrix  S  (see  text). 


matrices  on  waveguide  sections,  or  with  coupling  matrices  on  couplers,  so  long  as  we  use  S  to  transform 
between  the  views.  Given  a  structure  such  as  shown  in  Fig.  3,  we  can  describe  the  field  evolution  by  stepping 
through  the  structure,  as  described  below. 

We  can  relate  the  input  and  output  Jones  vectors  in  Fig.  3  by  inspection  using  2x2  matrices  (with  2x2 
block  matrix  elements)  for  the  operators  and  a  2  x  1  vector  (with  2x1  vectors  for  elements)  for  the  field 
vectors: 


AT 

’  out 
?(2) 

’  out 


47)0 

o47) 


clx)o 


0  c 


(y) 


J^O 

ojf 


c1(3;)  0 


0  c 


(y) 


4“}  0 

o4a) 


?(i) 

J  in 

fa) 


(6) 
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where  we  group  the  S  C  S  matrices  in  parentheses  to  connect  to  the  earlier  work[7],  That  is,  those  operators 
transform  from  the  waveguide  view  to  the  coupler  view,  apply  the  coupler  operator  to  the  state  in  coupler 
view,  then  transform  back  to  the  waveguide  view:  this  is  a  similarity  transform  of  C  into  the  waveguide 
view  used  earlier [7]. 

In  words,  we  describe  this  equation  (from  right  to  left)  as 

•  transporting  the  waveguide’s  fields  across  the  a  waveguides; 

•  changing  to  the  coupler  view,  operating  with  the  first  coupler,  and  then  changing  back  to  the  waveguide 
view; 

•  transporting  the  waveguide’s  fields  across  the  f3  waveguides; 

•  changing  to  the  coupler  view,  operating  with  the  second  coupler,  and  then  changing  back  to  the  wave¬ 
guide  view; 

•  transporting  the  waveguide’s  fields  across  the  7  waveguides. 

One  can  analyze  more  complicated  structures  by  walking  through  them  in  a  similar  way,  using  the  2x2 
block  form.  We  note  a  distinct  advantage  to  this  structure:  by  chaining  the  operators  to  keep  them  in  2  x  2 
block  form,  we  may  extend  operators  easily  at  each  stage.  That  is,  for  instance,  if  we  wanted  to  change  the 
form  of  C'2  '  to  model  normal  mode  losses,  it  is  possible  to  do  so  directly  by  modifying  C2  s  matrix  in  the 
coupler  view,  where  it  is  a  2  x  2  matrix. 

IV  FORM  OF  OPERATORS 

In  this  section,  we  summarize  the  forms  that  the  2x2  operators  take  in  expressions  such  as  Eqn  (6),  and 
compare  our  expressions  to  those  of  Ref  [7]. 


Polarization  dependent  loss 

The  simplest  optical  instrument  is  a  polarizer,  as  in  Fig.  4,  illustratively  described  in  the  waveguide  view 
as 

P(0)  =  [  ^  „0  1  0  <  Px,  Py  <  1 

U  Py 

where  the  diagonal  terms  describe  the  field  transmissions  in  the  x  and  y  polarizations  [8]:  an  ideal  x  polarizer 
would  have  px  =  1,  py  =  0.  The  rotation  matrix 


R(a) 


cos  a  —  sin  a 
sin  a  cos  a 


(V) 


which  actively  rotates  a  2D  vector  through  angle  a,  can  be  used  to  describe  a  polarizer  oriented  at  an  angle. 
The  matrix  product 

P(a)  =  R(a)P(0)R_1(a)  = 

describes  a  polarizer  whose  x  axis  is  oriented  as  shown  in  Fig.  4:  the  components  of  the  field  are  first  rotated 
by  —a,  then  operated  on  by  P(0)  expressed  in  its  own  basis,  followed  by  a  rotation  of  +a  to  restore  the 
original  orientation.  This  form  agrees  with  Eqn  (5)  of  the  earlier  formulation [7],  where  we  have  corrected  a 
typographical  error,  and  their  first  matrix  of  Eqn  (6).  We  will  generalize  this  below. 


cos  a  —  sm  a 


Px  0 
0  py 


cos  a  sm  a 
—  sin  a  cos  a 
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Fig.  4:  Polarizer  in  waveguide  view.  Polarizer  is  oriented  at  angle  0  with  respect  to  the  x  —  y  fiber  bases. 
Its  operator  is  a  similarity  transformation  of  the  operator  in  the  polarizer’s  basis. 

Birefringent  waveguides 

Propagation  in  birefringent  waveguides  is  described  naturally  with  the  Jones  calculus[8]-[10],  and  its 
extension  to  include  overall  optical  phase  [11,  12],  reviewed  in  Appendix  A  (Section  VIII).  Lossless  propa¬ 
gation  is  modeled  with  the  general  unitary  matrix 

J(0,<£,A>)  =  =  e-i* 

or 

J(^,A))  =  e-J'2*/2 

This  equation  has  all  four  degrees  of  freedom  (DOF)  permitted  by  unitary  matrices:  one  DOF  for  the  overall 
phase  (f>,  shown  in  Eqn  (37)  in  Section  VIII;  one  DOF  for  the  phase  difference,  p  =  2 ftz,  that  the  operator 
causes  between  components  in  the  operator’s  eigenstates;  and  two  DOF  for  the  unit  vector  the  location 
of  which  represents  the  operator’s  eigenstate. 

A  geometrical  representation  of  these  four  DOF  is  shown  in  Fig.  5,  which  illustrates  J  operating  on  an 
arbitrary  state  vector  [11,  12].  Note  that,  compared  to  the  conventional  representation,  state  |sj)  is  repre¬ 
sented  by  the  3D  vector  si  as  well  as  a  “fiducial  paddle”  which  permits  an  overall  phase  to  be  represented. 
In  this  representation,  the  “slow”  eigenstate  (eigenvalue  +1)  of  J  is  represented  by  the  unit  vector  f3,  whose 
components  are  the  (/3i ,  /32,  @3)  in  the  matrix  representation  Eqn  (10).  Operator  J,  in  the  3D  representation, 
rotates  states  about  /5  in  the  positive  (right-hand  rule)  sense  by  angle  <fi  =  2 /3z,  changing  the  latitude  and 
longitude  of  the  state,  and  hence  the  relative  amplitudes  and  phases  of  the  optical  components.  In  addition 
to  changing  the  relative  phases  of  |s)’s  components,  represented  by  the  changing  position  of  s,  the  operator 
also  changes  the  overall  optical  phase,  which  is  represented  by  a  fiducial  paddle  attached  to  s.  Thus,  in 
rotating  through  angle  <j>,  the  orientation  of  the  initial  fiducial  paddle  at  st  is  changed  to  the  grey  shaded 
paddle  at  Sf.  The  overall  phase  change  is  represented  by  a  rotation  of  the  fiducial  paddle  by  an  additional 


cos  f  -  j  sin  |  (/?3)  -j  sin  §  (/3i  -  j/32) 
-j  sin  f  (Pi  +  jp2)  cos  |  -  j  sin  f  (-/J3) 


(10) 


cos-ctq  -  j  sin-(/30  •  a) 


(9) 
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4 
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Fig.  5:  Geometrical  representation  of  general  unitary  matrix.  Operator  J  in  Eqns  (9,10)  with  eigenbasis 
0  rotates  state  about  0  through  angle  (f>.  Simultaneously,  the  fiducial  paddle  representing  overall  phase,  is 
rotated  about  s  by  angle  2  (p. 


2(f)  radians  in  the  positive  sense.  The  four  DOF  for  this  unitary  transformation  are  thus  completely  identified 
in  this  representation. 

Conversely,  any  unitary  matrix,  since  it  must  have  the  form  of  Eqn  (10),  can  thus  be  represented  by  its 
equivalent  0,  cf>,  and  cf>,  by  projecting  that  matrix  onto  the  basis  set  (do,  ai,  a2, 03)  and  identifying  the  phases 
[9] -[12].  We  will  use  this  to  compare  our  approach  to  the  earlier  model  [7], 


Coupler 


Ideal  coupling  matrices  are  unitary  matrices  similar  to  the  J  operators  discussed  above  and  reviewed 
in  Appendix  A  (Section  VIII):  they  conserve  power  and  are  linear  in  the  E  fields.  Consequently  their 
C  matrices  can  be  described  with  the  same  4  parameters  we  used  to  describe  propagation  operators  for 
birefringent  fibers:  what  changes  is  that  the  basis  states  are  now  unit  fields  in  waveguides  associated  with 
ports  1  and  2,  rather  than  unit  fields  in  x  and  y  polarizations. 


C {(f>,  </>,%)  =  e 


cos  -a 0 


•  ■  <t>(~ 

7  Sill  —  [K 
J  2  V 


cr 


(11) 


or 

cos  |  -  jsin|(K3) 

— J  sin  |  -jK 2) 

Thus,  2 (f>  =  2 R,z  describes  the  overall  optical  length  of  the  coupler,  (f>  describes  phase  differences  between 
fields  in  the  waveguides,  and  k  defines  the  eigenstates  of  the  coupler  in  terms  of  the  waveguide  modes.  A 
slight  difference  in  the  respective  interpretations  of  the  two  situations  is  that  the  coupling  components  are 


C(</>,  (j),  k)  =  e  j2*/2 


-j  sin  |  («]  -  jn2) 
cosf  -  j  Sin|(-K3)  _ 


(12) 
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conventionally  viewed  as  having  a  different  origin.  Components  n\  and  n-2  arc  obtained  from  the  overlap 
integrals  between  the  fields  in  the  two  modes  [13,  14]:  the  field  from  one  guide  creates  a  polarization  in  the 
other  guide,  which  drives  the  wave  in  the  first  guide,  and  vice  versa.  The  third  component,  Ky,  is  associated 
with  the  de-tuning  of  the  input  field  from  the  ideal  phase-matched  condition. 


V  APPLICATION 

Generalizing  earlier  work 

Earlier  work[7]  introduced  a  formalism  using  4x4  matrix  operators  to  describe  polarization-dependent 
interferometers.  Their  approach  used  coupling,  birefringence,  and  polarization-dependent  loss  (PDL)  opera¬ 
tors:  C(9Xl  6y),  D((j)),  and  T(anm),  respectively.  Additionally,  a  rotation  matrix  R(aa,  cp, )  was  introduced 
that  transformed  those  operators  to  different  PDL  and  birefringence  axes  on  waveguides  a  and  b,  respec¬ 
tively.  The  operators  arc  cast  in  what  we  call  the  “waveguide  view”  exclusively.  In  this  subsection  we 
generalize  those  operators. 


Operator  for  coupler 

The  coupling  operator,  again  for  i  — >•  —j,  was  given  as  [7] 


c  (9x,9y) 


cos (0X)  0  -  j  s in(0x)  0 

0  cos(9y)  0  —  j  sm(Oy) 

—j  sin(0z)  0  cos (0X)  0 

0  —  j  sin {Qy)  0  cos (9y) 


(13) 


where  9X  and  9y  represent  the  different  splitting  ratios  for  the  couplers  on  the  two  polarization  planes  in 
Fig. 2.  Since  this  operator  is  in  the  waveguide  view,  we  can  find  the  corresponding  operator  in  the  coupler 
view,  Cc,  by  a  similarity  transform: 

C(8x,8y)  =S  Cc  S-1  =s>  Cc  =  S-1C(9x,9y)  S.  (14) 

By  virtue  of  S_1  =  S,  this  means  that 


Cc  =S  C (9x,8y)  S 


cos (9X)  -  j  sin(0x)  0  0 

-j  s in(0x)  cos (9X)  0  0 

0  0  cos ( 8 y )  —  j  sin(0y) 

0  0  —  j  8^(0^)  cos (9y) 


(15) 


This,  in  turn,  means  that  the  coupling  matrices  for  each  of  the  x  and  y  polarizations  have  the  very  specific 
form 


C  ik\8k) 


COS  {0k)  -jsm(9k) 

— ^  r  — 

CW{8X) 

)  0 

-j  sin (9k)  cos (9k) 

'  — 

0 

C^(dy)  _ 

(16) 


where  each  coupler  operator  is  seen  to  have  one  degree  of  freedom,  9n.  But  as  we’ve  seen  (both  in  Section 
IV  and  in  Appendix  A,  Section  VIII),  the  coupling  operator  has  3  DOF  (or  4  if  one  includes  overall  phase), 
so  the  2x2  coupling  matrices  in  [7]  can  be  generalized  by 


&k\9k)^&k\fk,fk,ftk) 


0  C^y\(j)y,  (j>y,  Ry) 


(17) 


in  the  coupler  view.  We  now  illustrate  why  one  may  want  the  more  general  form. 
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For  an  ideal  coupler  with  identical  waveguides  placed  symmetrically  about  a  central  plane  orthogonal 
to  the  substrate  plane,  general  symmetry  arguments  show  that  the  eigenstates  arc  even  and  odd  functions 
[15].  The  usual  approach  [13]  uses  the  unperturbed  waveguide  modes  |1)  and  |2)  as  the  basis  set  to  form  the 
coupler’s  eigenmodes 

|<M  =  k)  =  ^<|1)  +  |2))  I'W  =  l°>  =  (I1)  - 12))  as) 


where  we  have  explicitly  labeled  the  even  and  odd  superposition  states. 

A  general  power  splitter  was  modeled  by  introducing  an  angle  that  corresponds  to  differing  coupling 
lengths  [7]:  reducing  the  4  DOF  in  Eqn  (12)  to  one  =  n 2  =  ^3  =  0),  the  operator  J  in  Eqn  (12)  reduces 
to 


Cideal(9) 


cos  6 

—j  sin  9 

ideal  3  dB  1 

i 

-j 

—j  sin  9 

cos  9 

71 

.  —j 

i 

(19) 


where  we  have  used  the  e+jbjl  convention. 

We  can  use  the  geometric  representation  to  show  the  difference  between  two  couplers,  both  of  which 
split  power  equally,  but  have  radically  different  forms.  In  the  geometrical  representation,  |1),  all  light  in 
guide  1,  is  represented  by  C3;  |2),  all  light  in  guide  2,  is  represented  by  —  c.>;  and  |e),  the  operator’s  slow 
eigenstate,  is  represented  by  k  =  ei,  the  axis  about  which  the  input  states  evolve.  For  an  ideal  3  dB  coupler, 
an  input  into  port  1  would  evolve  into  equal  power  in  both  guides  at  the  output  and  hence  would  correspond 
to  a  rotation  of  tt/2  about  the  ei  axis,  shown  as  the  first  operation  in  Fig.  6a.  If  this  ideal  coupler  were 
followed  by  another  identical  coupler  to  make  a  zero  path  length  interferometer,  the  system  would  evolve  as 
the  right  side  of  Fig.  6a.  This  form  (or,  equivalently,  6  — >  —9)  is  the  only  form  possible  for  a  3  dB  coupler 
with  one  DOF. 

An  extreme  example  of  another  3  dB  coupler  is  given  in  Fig.  6b.  In  this  case,  the  coupler  is  strongly 
detuned:  K3  in  this  coupler  (A  in  [13])  has  the  same  magnitude  as  k\,  but  it  is  still  a  3  dB  coupler  in  the 
sense  that  it  would  evenly  split  the  power  introduced  at  an  input  port.  The  system  evolution  is  quite  different: 
the  output  is  identical  to  the  input3,  and  no  fixed  phase  shift  between  the  stages  of  the  system  in  Fig.  6a  can 
replicate  the  behavior  of  Fig.  6b.  Using  Eqn  (17)  permits  greater  modeling  flexibility. 


Operator  for  propagation  in  birefringent  media 


Birefringence  is  manifested  as  different  propagation  phases  for  different  polarizations,  and  in  the  normal 
x  —  y  basis,  this  is  represented  with  different  phase  terms  on  the  diagonal.  To  permit  eigenstates  other  than 
the  basis  set.  Ref  [7]  uses  the  same  rotation  matrix,  Eqn(7),  used  for  PDF  in  Eqn  (8),  for  each  of  the  2x2 
diagonal  blocks  as 


R(ct)  D(</>n)R-1(a)  : 


D(0n) 


e~J<t>x  0 
0  e-^y 


e~j * 


e-M/2  0 

0  e+I^/2 


(20) 


where  we  have  made  the  substitution  i  — >•  —  j  to  convert  to  the  eJUJi  convention  [10,  11,  12].  D  contains  two 
DOF:  common  phase  f>  =  {(f) x  +  4>y)/2,  and  differential  phase  0  =  <i>x  ~  4>y  Performing  the  multiplication 
in  Eqn  (20),  we  find  that 


sin  2a 

R(ct)D(</.)R“1(a)  =  e~j2^2 

cos  <f>  oo  —  j  sin  cj)((3  •  a) 

:  0— ► 

0 

' 

cos  2  a 

3  Technically,  all  states  would  have  picked  up  an  additional  n  radians  of  phase  shift,  as  can  be  seen  by  tracking  the  fiducial 
paddles. 
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Fig.  6:  Two  different  ‘  3  dB’  couplers  in  an  interferometer,  (a)  Ideal  coupler,  zero  detuning.  Each  coupler 
transit  is  represented  by  a  7r/2  rotation  about  the  even  state,  (ei.  (b)  Detuned  coupler.  Waveguide  1  has 
greater  index,  which  would  prevent  full  power  transfer.  Each  coupler  transit  is  represented  by  a  vr  rotation 
about  (ei  +  e^)/y/2 


showing  that  /3  has  only  one  degree  of  freedom,  not  two.  (We  might  also  have  noted  that,  instead  of  4  DOF 
as  in  Eqn  (17),  the  operator  in  Eqn  (21)  has  3  DOF:  two  for  the  birefringence  D (<f)n),  and  one  for  a ,  the 
elevation  of  (3  above  ei  in  the  ei  —  e3  plane.)  The  similarity  transformation  implied  in  Eqns  (20,21)  thus 
corresponds  to  eigenstates  {}  limited  to  linear  birefringence. 

In  contrast,  the  birefringence  operator  J  in  Eqn  (10)  has  4  DOF,  and  has  eigenstates  with  two  DOF.  In 
fact,  it  can  be  shown  that  J  is  a  similarity  transform  of  the  birefringence  operator  D  (<f)n),  but  with  eigenstates 
corresponding  to  the  normal  form  (see  Appendix  A,  Eqns  (9,10),  and  Fig.  5) 


cos  |  e  J'^/2 
sin  §  e+M/2 


(22) 


which  expresses  two  DOF  for  the  slow  birefringence  eigenstate. 

In  terms  of  modeling,  the  two  formulations  would  thus  lead  to  different  statistical  distributions  in  bire- 
fringent  links  and  their  interference.  In  principle,  the  same  argument  can  be  made  about  the  representation  of 
the  rotated  partial  polarizer:  rotation  matrix  Eqn  (7)  allows  PDF  for  linear  eigenstates  only,  while  J  permits 
more  general  eigenstates. 
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Summary 

We  have  shown  that  generalizing  the  rotation  operator  opens  more  degrees  of  freedom  to  the  coupler, 
birefringence,  and  PDL  operators  in  Ref  [[7]].  But  the  basic  transformation  introduced  with  Eqn  (5),  is  also 
useful  in  extending  analyses.  For  instance,  the  transformation  shows  the  structure  of  the  differential  equation 
in  Ref  [6]  treats  only  coupling  between  similar  polarizations.  We  have  made  the  same  assumptions  here, 
but  the  transformation  is  useful  in  comparing  different  formulations.  By  using  a  transformation  between  the 
waveguide  and  coupler  views,  we  can  treat  all  components  as  simpler  block  diagonal  matrices,  at  the  cost  of 
introducing  a  similarity  transform.  In  the  next  subsection,  we  show  that  there  arc  advantages  to  this  tradeoff: 
we  apply  this  advantage  to  an  analysis  of  demodulators  in  microwave  photonic  links. 

Interferometric  demodulators  in  microwave  photonic  links 

Overview 

In  this  section,  we  apply  the  formalism  to  address  the  modeling  of  interferometers  with  birefringent 
elements,  schematically  shown  in  Figs.  1  and  3.  The  starting  point  is  the  operator  in  Eqn  (6),  which  is  a 
4x4  matrix  operator  taking  the  input  state  to  the  output  state.  Generally,  one  is  interested  in  the  detected 
power,  so  the  inner  product  of  the  final  state  is  taken  with  itself.  This  operation  can  be  performed  either  by 
constructing  an  overall  matrix  by  pre-multiplication  by  the  matrix’s  adjoint,  or  by  taking  the  inner  product 
of  the  output  directly.  Application  packages  such  as  Mathematica  make  keeping  track  of  the  fields  more 
manageable. 


General  development 

To  clarify  the  development  of  the  output  fields  associated  with  Eqn  (6),  we  show  an  exploded  diagram 
such  as  Fig.  7  to  help  identify  the  matrix  elements  in  the  analysis.  Fig.  7a  shows  each  of  the  diagonal 
subsystems,  the  2x2  matrices,  as  a  shaded  rectangle,  while  Fig.  7b  identities  the  functions  of  the  individual 
matrix  elements  for  both  the  couplers  and  the  waveguides. 

As  an  illustrative  example,  assume  that  the  input  is  on  port  1  and  has  x  polarization,  i.e.  E\x  =  1  and 
all  other  input  fields  are  zero.  We  analyze  the  system  to  find  the  powers  at  exit  ports  1  and  2,  the  powers  that 
will  be  collected  by  fibers  1  and  2,  respectively.  Defining  the  i  —  j  coupling  coefficient  in  Fig.  7  for  coupler 
n  and  polarization  k  as  (C n^)ij  =  cnkij ,  the  fields  at  the  output  are  given  by 


C2SJSCiS 


C2a:ll^jl')CLrll  +  C2xl2Jyi  C\x21 
C2yll«72i')Cia;ll  +  C\x21 

C2x2lJxi  Clxll  +  C2a;22«7ii^Ci*21 
C2y21</2i')Clrrll  +  C2y22«72i^Cla;21 


(23) 


Thus,  for  example,  the  y  component  on  fiber  2  is  the  superposition  of  light  following  two  paths:  (i)  across  the 

(x) 

top  arm  of  coupler  C  ,  ,  coupled  into  the  y  polarization  through  waveguide  Ji,  and  then  coupled  diagonally 

(v) 

across  C.2  to  port  2;  and  (ii)  coupled  diagonally  across  C  j  to  waveguide  2,  coupled  into  the  y  polarization 

(v) 

through  waveguide  J2,  and  across  the  bottom  arm  of  coupler  C2  .  This  is  represented  as  the  bottom  element 
on  the  right  of  Eqn  (23). 

These  outputs,  as  in  the  earlier  development^],  are  adequate  for  CW  inputs,  but  need  to  be  modified 
slightly  for  modulated  inputs.  Specifically,  the  effects  of  the  interferometric  waveguides  can  not  be  repre¬ 
sented  by  their  phase  delay  alone,  since  the  field’s  modulation  does  not  appeal-  instantly  at  the  output.  The 
standard  scalar  field  Fourier  analysis,  for  modulation  frequencies  much  less  than  the  carrier  frequency,  shows 
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Fig.  7:  Interferometer  analysis,  (a)  Schematic  as  in  Fig.  2  showing  coupler  polarization  planes  and  bire- 
fringent  fibers.  Matrix  elements  are  represented  by  lines  ( x  are  dotted,  y  are  solid)  between  elements,  (b) 
Definition  of  matrix  elements  for  coupler  plane,  C,  and  waveguides,  J. 

that  the  modulation  envelope  travels  at  the  group  velocity,  while  the  carrier  travels  at  the  phase  velocity  [14]. 
To  treat  our  birefringent  waveguides,  we  assume  that  polarization  mode  dispersion  (PMD)  is  unimportant. 
By  this  we  mean  that  the  modulation  envelopes  travel  at  the  group  velocity  through  the  fiber,  but  that  the 
differential  group  delay  (DGD)  between  the  principal  states  of  polarization  is  insignificant  for  the  modula¬ 
tion  imposed[10].  In  this  regime,  the  modulation  envelopes  suffer  a  common  group  delay  t  while  the  states 
of  polarization  of  all  the  spectral  components  essentially  experience  the  same  evolution.  We  can  then  treat 
the  fields  as  in  Eqn  (23)  with  the  understanding  that  the  modulation  envelope  must  be  tracked.  For  instance, 

J^E{t)ejut  — >  J^Eit  -  T)ejut  (24) 

where  we  have  subsumed  the  phase  delay  into  the  definition  of  J,  and  the  E  (t)  includes  both  amplitude  and 
phase  modulation.  Then  expansions  such  as  Eqn  (23)  must  be  tracked  with  the  fields  on  which  they  operate. 

Modeling  normal  mode  loss 

In  this  section,  we  demonstrate  the  utility  of  the  vector  space  formalism  by  modeling  couplers  with 
normal  mode  loss.  We  exploit  vector  space  techniques  to  calculate  the  coefficients  in  a  2  x  2  coupler.  Given 
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these  matrix  elements,  one  can  then  put  them  into  the  appropriate  blocks  in  Eqn  (6).  The  basic  idea  is  to  use 
another  transformation  from  the  usual  coupler  basis  to  the  normal  mode  basis  for  the  coupler  to  model  the 
loss,  then  transform  back  to  the  original  basis. 

We  first  note  that  a  symmetric  polarization- independent  coupler  can  be  viewed  in  terms  of  its  even  and 
odd  normal  modes[15]  and  the  matrices  which  transform  co-ordinates  between  the  port  co-ordinates  and  the 
normal  mode  co-ordinates. 


v^L  -i 


V2 


1  1 
1  -1 


The  length  of  the  coupler  determines  its  coupling  ratio,  and  this  is  determined  by  the  beating  between  the  two 
modes  as  they  propagate  through  the  length  of  the  coupler.  For  an  ideal  coupler,  disregarding  the  common 
phase,  this  would  give  rise  to  a  coupling  matrix  of 


i  r  i  i  i  r  e"-^/2  o  IT11 

“  2  L  1  —  1  J  L  0  e+j9c/2  \  [  1  -1 


cos 

-j  sin  ^ 


•  •  6c 

—j  sin  ~y 

Oq 

cos 


(26) 


namely  the  ideal  generalized  coupler  of  Ref  [7].  In  terms  of  the  geometrical  representation,  the  coupler  is 
represented  by  a  rotation  about  the  ei  through  an  angle  of  9  c'.  for  an  ideal  3 dB  coupler,  9c  =  vr/2,  and  an 
input  to  port  1  (|  1)),  would  evolve  to  state  |— ),  for  instance. 

If  both  the  even  and  odd  modes  experience  the  same  loss,  the  output  is  merely  scaled,  so  to  model 
differential  mode  loss,  we  assume  that  the  E  field  transmission  in  the  even  mode  is  unity  while  that  in  the 
odd  mode  is  p  <  1.  Assuming  an  otherwise  ideal  coupler,  the  coupling  matrix  would  then  be 

i  r  i  i  l  r  e-w4  o  iriii  =  e-^/4  r  i+jP  i-jP  i 
!  2  [  1  -1  J  [  0  Pe+jn/4  J  [  1  -1  J  2  [  1-jp  l+jp  \  '  ^ 


For  p  =  I  and  input  on  port  1 ,  one  sees  that  the  amplitudes  at  the  output  ports  are  equal  in  amplitude  and  7r/2 
out  of  phase.  If  there  is  normal  mode  loss,  however,  the  amplitudes  arc  still  equal,  but  the  phase  difference 
decreases  in  magnitude.  Note  that  this  is  an  example  for  which  merely  specifying  a  power  splitting  ratio 
would  not  capture  the  coupling  matrix,  and  it  illustrates  the  usefulness  of  working  in  the  block  diagonal 
form. 


Output  for  general  modulation  and  normal  mode  loss 

To  give  an  example  with  simplified  algebra,  we  assume  that  the  two  couplers  have  identical  normal  mode 
losses  as  in  Eqn  (27),  and  therefore  that 

e-W4  e-W4 

(C'n  )ll  ^nfell  Cnfc22  ^  (1  T  JP )  Cnkl2  C-nk21  ^  (1  JP )•  (28) 

These  arc  the  coefficients  that  arc  substituted  into  Eqn  (23),  in  evaluating  inner  product  E  '  E,  and  we  assume 
here  that  they  have  a  common  delay  which  can  be  ignored.  The  matrix  elements  for  the  waveguides,  how¬ 
ever,  must  be  attached  to  the  fields  they  transport,  as  mentioned  above.  Specifically,  the  fields  in  our  fiber  1 
arc  assumed  to  suffer  a  time  delay  of  r  with  respect  to  the  fields  in  fiber  2. 

Tracking  the  fields  permits  an  identification  by  matrix  element  index  set,  and  when  the  inner  product  is 
taken,  those  indices  indicate  the  field  on  which  the  element  operates.  Expanding  Eqn  (23)  and  treating  the 
terms  as  operators,  we  can  find  operators  for  the  powers  in  the  two  fibers  as 

A  =  (1^2)2  [i  +  2Re  (b  e~'j4tarr'  p^ 


(29a) 
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P2 


(1  +P2)2 
16 


A  +  2R  eB 


(29b) 


where 

A  =|  j[{]  |2  +  |  |2  +  |  |2  +  |  4l]  |2  and  (30a) 

B  =  J(ii]Jn  +41^4?  •  (30b) 

The  carets  in  the  equations  above  signify  that  the  scalar  matrix  elements  arc  to  be  linked  to  their  respective 
time-delayed  field  operands.  Once  they  arc  so  linked,  they  can  be  viewed  as  scalars.  After  performing  this 
identification  of  the  matrix  elements  with  their  time  delayed  envelopes,  we  arrive  at  an  expression  for  the 
time  dependence  of  the  powers  exiting  the  coupler  ports: 


Pi  =  —  [E*(t-T)E{t-T)  +  E{t)*E(t)  +  2Re  [-B  e+jAe*E*{t  -  r)E(t)]] 

P-2  =  (1 [E*t  -  t )E(t  -  t)  +  E*(t)E(t)  +  2Re  [B  E*(t  -  r)E(t)]\  , 
lb 

where  the  (nominally  small)  angle  ep  is  expanded  in  p  as 


7 r 

4 


ep  =  —  —  tan  1  p  =  tan 


-i  1  ~  P 

1  +p 


,2  ' 


(31a) 

(31b) 

(32) 


For  an  ideal  coupler,  ep  =  0,  and  Eqns  (31)  show  that  the  second  terms  in  the  expressions  for  P\  and /A 
arc  complementary,  regardless  of  the  relative  birefringence  in  the  interferomter’s  arms.  For  couplers  with 
normal  mode  loss,  ep  A  0,  and  Eqns(3 1)  show  that  there  is  dc  phase  shift  added  to  the  ac  signal  created  by  the 
interfering  fields.  We  illustrate  this  for  the  case  of  a  phase  modulated  input  applied  to  such  an  interferometer 
to  demodulate  it. 


Output  for  phase  modulation  and  normal  mode  loss 

Eqns  (31)  show  that  the  output  powers  vary  from  intensity  effects  (the  A  operator  which  gives  rise  to  the 
E*E  terms)  and  from  interference  terms.  Here,  we  investigate  the  effect  of  coupler  normal  mode  loss  on 
the  demodulated  output  in  a  phase  modulated  link,  for  which  the  amplitude  is  constant.  That  is,  assuming 
an  input  field  with  a  phase  modulated  optical  carrier  of  form 

E(t)  =  E0(t)e>(“ot+,t,mSinnt\  (33) 

what  effect  will  there  be  on  the  demodulator  output  ports  when  the  links  arc  birefringent  and  the  couplers 
suffer  unequal  normal  mode  losses?  The  ideal  case  for  a  scalar  field  was  investigated  by  Youngquist  et 
n/.[15],  but  we  generalize  the  demodulator  to  include  polarization  dependence  in  the  links  and  couplers. 

For  phase  modulation  with  an  rf  tone  at  <2,  as  in  Eqn  (33),  the  interference  term  is  given  by 

E*(t—  r)E(t)  —\  E0  |2  ef<Msinf2t-sinn(i-T))  _|  eq  |2  gi2</>m  sin  cos(f2t-  . 

Assuming  path  matching  (Hr  =  it),  lossless  fibers  {A  =  2),  and  quadrature  biasing  (B  ~  j),  the  powers  in 
the  two  arms  arc 

(l  D2 \ 2 

Pi  =  - - - -  |  Eq  |2  [1  +  sin(4ep  +  2</>msinHt)]  (34a) 

O 

(l  r>2\2 

P2  =  - - — —  |  E0  |2  [1  -  sin(2(/>m  sin  Qt)]  , 


(34b) 
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which,  in  a  balanced  detection  scheme,  would  result  in  a  detected  power  of 

(1  +  ^ 

Pdet  =  P\  ~  P-2  =  - - - [(1  +  cos  4ep)  sin(2< f>m  sin  fit)  +  sin  4ep  cos(2 <f>m  sin  fit)]  .  (35) 

8 

The  net  result  of  this  is  that  the  output  would  appeal-  to  be  offset  from  quadrature  by  approximately  2ep. 
This  effect  would  be  small,  but  noticeable:  for  a  coupler  with  a  0.25  dB  differential  loss,  for  instance, 

Iti  10 

2ep  «  1  -  p  ss  — —dBioss  =  0.058  rad  , 

where  ‘ dBioss ’  is  the  loss  one  would  measure  between  a  single  input  and  the  sum  of  the  two  outputs  of  the 
coupler. 

Summary 

In  this  section  we  have  shown  how  the  formalism  can  include  interferometric  structures  with  modulated 
inputs,  extending  the  usual  scalar  approach  which  treats  only  single  tones.  We  presented  a  sample  calculation 
showing  how  the  matrix  elements  can  be  derived  by  using  linear  vector  space  techniques  to  exploit  known 
behavior  in  the  normal  modes  of  the  system  elements,  specifically  treating  normal  mode  loss  for  couplers  in 
a  phase  modulated  microwave  photonic  link. 

VI  SUMMARY 

We  have  presented  a  formalism  which  uses  standard  linear  vector  space  techniques  to  analyze  interfero¬ 
metric  structures  with  birefringent  and  lossy  elements.  By  introducing  a  simple  co-ordinate  transformation 
to  move  between  descriptions  of  the  fiber  and  the  coupler  elements,  we  showed  that  the  4x4  matrices 
for  all  of  the  elements  could  be  reduced  to  2  x  2  diagonal  blocks.  This  simplification  permits  each  of  the 
system  components  to  be  treated  in  the  most  general  way:  couplers  and  waveguides  can  be  treated  with  the 
customary  two  dimensional  unitary  matrix  techniques,  and  polarization-dependent  losses  can  be  introduced 
with  arbitrary  pass  eigenstates.  Because  the  reduction  to  the  two-dimensional  space  is  so  fundamental,  and 
because  its  formalism  covers  both  couplers  and  birefringent  waveguides,  we  included  an  Appendix  which 
derives  and  summarizes  the  main  results  of  the  theory.  The  Appendix  also  provides  a  brief  summary  of 
the  geometric  representation,  which  is  useful  both  in  visualizing  problems  and  solutions,  and  in  casting  the 
problems  into  a  uniform  framework.  We  illustrated  the  formalism  by  analyzing  the  power  distribution  at 
the  output  ports  for  a  demodulator  in  a  phase  modulated  microwave  link.  We  found  that  normal  mode  loss 
creates  a  dc  bias  phase  to  the  interferometric  output  and  we  showed  how  to  estimate  that  phase. 
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APPENDIX  A 


VIII  APPENDIX- A:  ELEMENTS  OF  COUPLED  MODE  THEORY  IN  TWO-  AND  THREE-  DI¬ 
MENSIONS 

Representation  of  2D  complex  vectors  in  a  real  3D  space  for  telecom  applications  has  been  developed  in 
the  literature  [9,  10,  11,  12],  and  the  main  features  needed  for  the  development  of  this  paper  are  summarized 
here  for  convenience. 


2D  equation  of  motion 

Many  physical  systems  can  be  modeled  as  a  differential  equation  with  a  two-dimensional  matrix  evolu¬ 
tion  operator  as 
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(36) 


where  I  is  the  2x2  identity  matrix  and  where  the  average  of  the  diagonal  elements,  k,  has  been  identified 
explicitly.  (The  variables  are  barred  because  we  will  transform  them  later  into  more  convenient  forms.) 
In  particular,  this  equation  describes  both  propagation  in  birefringent  media  and  coupling  in  2  x  2  optical 
directional  couplers.  For  birefringent  propagation,  the  a,  arc  the  field  components  in  two  orthogonal  po¬ 
larizations,  the  Rj  represent  the  birefringence,  and  k  signifies  the  overall  optical  path  length.  For  2x2 
directional  couplers,  the  a*  are  field  amplitudes  in  the  two  waveguides,  the  Rj  represent  the  coupling  con¬ 
stants  between  the  waveguides  and  the  detuning  from  phase  matching,  and  k  represents  the  overall  optical 
phase.  Eqn  (36)  is  “conservative”  in  the  sense  that,  for  real  k  and  Rt,  the  norm  of  the  2D  vector  (assumed  to 
be  +1)  is  preserved.  This  reduces  the  vectors  to  3  DOF:  the  components’  relative  magnitude,  relative  phase, 
and  their  common  overall  phase. 


Suppressing  overall  phase 

In  some  situations,  the  overall  phase  is  unimportant  and  can  be  ignored.  In  such  cases,  the  common 
phase  constant  k  (which  may  be  a  function  of  distance)  can  be  suppressed  by  transforming  [9]  to  “slowly 
varying”  components  as 

ai{z)  ->  e-ii~k^dz  ai(z)  =  e-rtai (z),  (37) 

yielding  an  equation  of  motion  for  those  slowly  varying  components 


at 

K3  Kl-jK  2 

at 

a2  _ 

=  ~j 

R\  +  jR'2  -«3 

a2 

(38) 


This  equation  suppresses  the  overall  phase,  makes  the  operator  in  brackets  a  traceless  Hermitian  matrix,  and 
can  be  solved  for  the  slowly  varying  a,.  If  desired  at  the  end,  the  overall  phase  can  be  re-introduced  by  using 
Eqn  (37)  to  regain  the  d,. 

Eqn  (38)  permits  R  to  vary  with  z,  but  in  what  follows,  we  consider  the  case  for  constant  R  to  simplify 
the  development. 
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Pauli  matrices 

In  both  the  two-  and  three-dimensional  representations  of  2D  state  vectors,  the  Pauli  matrices  play  a 
pivotal  role,  since  they  connect  the  2D  and  3D  representations,  and  they  form  (with  the  identity  matrix)  a 
basis  for  2x2  matrices.  They  arc  given  by 


'  1 
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1  ' 

0  -j  ' 

'  1  0 

CT0  = 
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;  o- 1  = 
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0  -1 

Any  matrix  can  be  expressed  in  terms  of  this  basis  as 

M  =  M0a0  +  Mi<ji  +  M2 <72  +  M3CJ3  =  Mtat  =  M0a0  +  M  ■  a,  (40) 

where  the  abstract  “Pauli  vector”  is  given  by 

<7  =  (Jiei  +  a2e2  +  e3£73  =  ,  (41) 


and  the  expansion  coefficients  M*  in  Eqn  (40)are  given  by 

Mi  =  l-Tr  (<Ji  M)  =  l-Tr  ( Ma t)  .  (42) 

For  Hermitian  M ,  the  Mi  are  real,  and  thus  for  traceless  Hermitian  operators,  M  can  be  described  by  a  real 
3D  vector,  M.  Therefore,  Eqn  (38)  can  be  cast  as 
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(43) 


It  is  useful  to  express  k  in  terms  of  its  magnitude  and  unit  vector: 


k  =  kk  =  K(fciei  +  n2e2  +  ^363)  =  n  k =  «(sin  Oq  cos  foei  +  sin  9q  sin  4>  0^2  +  cos  ^0^3)  ,  (44) 


where  the  last  expression  puts  k  into  polar  co-ordinates  referenced  to  the  et  basis.  This  permits  the  equation 
of  motion,  Eqns  (38,  43)  to  be  written  in  either  Cartesian  or  polar  co-ordinates  for  the  operator  described  by 


k: 


where  D  describes  unit  vector  k 
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+  JK2 
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sin  #0  e+^° 
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(46) 


General  2D  solution 


For  fixed  k,  the  equation  of  motion  for  the  state,  Eqn  (45)  suggests  an  eigenvalue  problem.  Indeed,  one 
can  show  that  the  eigenvectors  of  D  can  be  expressed  in  terms  of  the  polar  co-ordinate  angles  as 


IV’s)  ~ 


cos  e  ^o/2 
sin  e+^o/2 


Hf)  ~ 


sin  f  e-^°/2 

-  cos  f  e+jfo /2  J  ’ 


(47) 


where  the  labels  signify  “slow”  and  “fast”  eigenstates  (in  accordance  with  convention  for  optical  propagation 
in  birefringent  media)  and  the  eigenvalues  +1  and  —1,  respectively.  Dirac  notation  signifies  that  we  view 
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the  states  as  abstract  vectors,  whereas  the  brackets  indicate  the  states’  representations  in  the  chosen  basis  for 
our  complex  2D  space. 

For  k  constant,  eigenstates  (47)  can  be  used  to  diagonalize  Eqn  (45).  That  is,  a  2  x  2  complex  matrix 
S  is  formed  from  the  two  column  vectors  in  Eqn  (47),  and  its  adjoint,  S',  is  transposed  complex  conjugate. 
After  transforming  to  the  basis  formed  by  the  eigenstates,  the  equation  of  motion  Eqn  (45)  can  be  integrated 
directly  to  find  the  unitary  transformation  matrix  relating  the  state’s  components  at  position  “ z ”  to  those  at 
the  input  as 
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(48) 

The  matrix  operator  above  is  unitary:  its  inverse  is  its  adjoint.  It  expresses  the  three  DOF  for  the  operator 
described  by  if  in  terms  of  three  angles:  9q  and  4>o  describe  the  direction  of  the  unit  vector  k,  and  Rz  is 
related  to  both  the  magnitude  of  k  and  the  distance  z  over  which  the  system  evolves. 

Recall  that  this  is  the  solution  for  constant  coefficients.  If  k  changes  with  z,  the  solution’s  form  is  more 
complicated.  However,  one  can,  at  least  in  principle,  imagine  stepping  the  solution  over  segments  A z  small 
enough  that  the  matrix  can  be  considered  constant,  and  constructing  a  solution  out  of  products  of  the  solution 
in  Eqn  (48).  Euler’s  theorem  [9], [11]  insures  that  the  result  is  equivalent  to  a  single  overall  rotation. 

Eqn  (48)  can  be  expressed  more  compactly  as 


(H  {z) 

2  KZ 

2  KZ 

= 

COS  — -  (To 

-  J  sin  — -  (k  ■  a) 

(12  (Z)  _ 

2 

2 

oi  (0) 
«2  (0) 


_  g -j(nz)k-a 


ai  (0) 

<12  (0)  J  ’ 


(49) 


which  clearly  shows  the  effect  on  the  eigenstates:  if  the  initial  vector’s  components  «,;(())  are  the  eigenvec¬ 
tors’  components  in  Eqn  (47),  then  k  ■  a  leaves  the  vector  unchanged,  returns  ±1  to  the  exponent,  and  the 
state  vector  is  multiplied  by  a  phase  factor  e^'JKZ .  In  particular,  for  future  reference,  we  note  the  special 
operator  given  by  k  =  Re 3:  in  this  case  9q  =  0  and  Eqn  (48)  reduces  to 
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(50) 


Finally,  we  make  note  of  the  middle  expression  of  Eqn  (49),  which  contains  “superfluous”  factors  of  2  : 
these  enable  a  direct  connection  to  the  geometrical  representation,  to  which  we  now  turn. 


Geometric  representation 

The  above  has  shown  that  both  the  matrix  operator  and  its  (+1)  eigenstate  can  be  characterized  by  unit 
3D  vectors.  In  this  section,  we  review  the  geometric  representations  of  states  and  operators.  The  description 
of  |  As)  in  Eqn  (47)  is  a  particularly  useful  construction,  and  we  refer  to  it  as  the  “normal  form:” 


\^)  ~ 


cos  I  e~j^2  ' 
sin  f  e+^/2 


(51) 


In  the  form  of  Eqn  (51),  the  angles  9  and  <f>  correspond  to  the  polar  and  azimuthal  angles,  respectively,  of  the 
unit  vector  representing  that  state.  For  0  <  9  <  it,  the  relative  amplitudes  run  from  unity  in  state  1 1)  to  unity 
in  state  |2),  and  (P  is  the  phase  difference  between  the  two  co-ordinates.  It  is  clear  that  in  Eqn  (47),  state  A.,) 
is  already  in  normal  form  while  A/)  can  be  brought  into  normal  form  with  the  change  of  variables 


9f  =  it  -  9S  ;  <f>f  =  <j>s  -  sgn(< f>s). 
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Fig.  8:  Representation  of  states.  States  in  normal  form  (Eqn  (5 1))  are  represented  in  3D  by  plotting  compo¬ 
nents’  relative  amplitudes  via  polar  angle  6  and  relative  phases  via  azimuthal  angle  <p.  Angles  are  doubled 
in  this  representation,  and  orthogonal  states  are  on  opposite  sides  of  the  sphere. 


This  shows  that  orthogonal  vectors  are  opposite  each  other,  rather  than  at  right  angles,  and  is  an  illustration  of 
the  fact  that  in  the  geometric  representation,  the  angles  are  twice  those  of  the  angles  in  the  2D  representation. 
An  illustration  of  how  states  map  is  shown  in  Fig.  8. 

The  Cartesian  co-ordinates  of  the  same  state  | -0)  are  given  by 


Si  =  (s|  CTj  |s)  =4 

so  that,  for  instance,  the  component  along  rfi  is 


s  =  (s |  o-iei  | s)  =  (s|  <t|s)  , 


si  =  (s|  o\  |s)  = 
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(52) 


=  sin  6  cos  4>,  (53) 


as  expected  from  the  polar  co-ordinates  in  Fig.  8.  This  is  analogous  to  Eqn  (44).  By  virtue  of  Eqn  (51),  and 
its  identification  of  angles,  either  polar  or  Cartesian  co-ordinates  establish  mappings  such  as 
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(54) 

where  we’ve  expressed  the  two  basis  states,  |1)  and|2),  their  even  superposition,  |e),  and  their  phase- 
advanced  superposition,  |  +) .  While  useful,  states  in  the  normal  form  do  not  show  the  overall  phase  necessary 
to  describe  interferometric  structures. 

A  state’s  evolution  under  an  operator,  such  as  in  Eqn  (48)  has  an  intuitively  appealing  description  in 
the  geometric  representation:  it  is  a  precession  of  the  state  vector  about  the  vector  corresponding  to  the 
slow  eigenstate,  k,  at  a  rate  of  2  R,  shown  in  Fig.  9.  In  this  figure,  an  initial  state,  which  corresponds  to 
the  3D  vector  si,  evolves  by  precessing  about  the  vector  it  at  a  rate  2k,  tracing  out  a  circle  on  the  unit 
sphere  (described  in  more  detail  below).  We  note  that:  (i)  both  the  operator  and  the  state  are  represented 
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Fig.  9:  State  evolution.  In  3D  representation  space,  the  state  that  corresponds  to  .st  evolves  to  the  state 
corresponding  to  st  under  the  operator  corresponding  to  k  by  rotating  about  k  through  an  angle  2d)  =  2 kz, 
for  constant  H. 


by  vectors  in  the  same  3D  space;  (ii)  the  state  vector’s  magnitude  doesn’t  change,  since  it  is  being  rotated; 
(iii)  the  state’s  rotation  rate  (and  thus  the  angular  displacement)  in  the  3D  representation  is  twice  its  rate  in 
the  2D  representation;  (iv)  the  arc  length  of  an  evolution  through  angle  2 Rz  depends  on  the  angular  distance 
between  if  and  sf  it  is  a  line  of  latitude  when  k  is  considered  a  pole;  and  (v)  in  particular,  for  the  two  state 
vectors  parallel  to,  or  anti-parallel  to,  k,  the  rotation  does  not  change  the  vector’s  position.  These  two  states 
correspond  to  the  slow  (eigenvalue  +1)  eigenstate  (along  +k)  and  the  fast  (eigenvalue  —1)  eigenstate  (along 
-«)■ 

It  can  be  shown[ll]  that  the  form  of  the  operator  in  Eqn  (48)  is  a  natural  outgrowth  of  the  earlier 
concepts.  That  is,  consideration  of  Eqns  (50,  51)  shows  that  Cop  operating  on  \ip)  increases  <fi/2  by  Rz 
while  keeping  9  constant.  Thus,  operator  Cop  corresponds  to  a  precession  of  the  vector  s,  representing  | ip) 
about  £3.  But  Eqn  (48)  is  just  a  similarity  transform  of  Co,o  by  the  S  formed  from  the  eigenstates  | ifs) 
and  |V’/)!  which  correspond  to  ±k.  Thus,  Eqn  (48)  is  the  operator  which  performs  (in  basis  ±k)  the  same 
operation  that  Cop  performs  (in  basis  ±63),  namely  a  precession  about  <23  at  rate  2 Rz.  This  same  result  can 
be  derived  by  direct  computation  of  the  transformed  differential  equation[9]. 

The  summary,  to  here,  is  the  conventional  geometrical  description  of  the  coupled  mode  equations,  and 
has  ignored  the  common  phase  which  was  suppressed  in  going  to  the  slowly  varying  amplitudes.  In  the  next 
section,  we  re-introduce  the  common  phase. 


Generalization  to  vectors  with  overall  phase 

Recently  we  introduced  a  formalism  which  permits  tracking  of  the  overall  phase,  the  third  DOF  for  a 
complex  vector  with  unit  norm.  In  addition  to  the  position  of  the  state  in  3D,  its  overall  phase  is  represented 
by  a  “fiducial  paddle”  as  shown  in  Fig.  10.  The  cardinal  states  are  shown,  illustrating  that  orthogonal  states 
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Fig.  10:  Representation  of  phase  by  fiducial  paddles.  Arbitrary  state  \ip)  is  represented  by  the  3D  vector  s 
and  associated  fiducial  paddle.  The  relative  amplitudes  and  phases  with  respect  to  the  2D  bases  are  shown 
by  s’ s  polar  and  azimuthal  angles,  6  and  cj),  respectively.  The  normal  form  (with  zero  overall  phase)  is 
shown  by  the  shaded  fiducial  paddle.  Overall  phase  cj)  is  represented  by  rotating  the  fiducial  paddle  through 
positive  (right-hand  rule)  angle  2 (f>. 


are  on  opposite  sides  of  the  unit  sphere.  4  An  arbitrary  state  has  form 


1^)  =  e 


j<p 

cos0/2e  •7^/2 

=  g— j2</>/2 

cos0/2e 

sin  9 /2e+^/2 

sm6,/2e+J<^//2 

(55) 


where  the  right  hand  expression  is  written  to  show  the  angles  as  represented  in  the  3D  space.  Such  a  state  is 
shown  in  the  figure  to  illustrate  fiducial  paddles,  which  comprise  a  colored  and  a  blank  blade  to  show  rotation 
angles.  In  the  figure,  the  grey  paddle  (aligned  to  the  line  of  longitude  for  cj))  corresponds  to  a  general  state  in 
normal  form,  Eqn  (51).  The  darker  paddle  corresponds  to  the  normal  form  after  multiplication  by  an  overall 
advancing  phase  factor  e~^\  the  paddle  is  rotated  in  a  positive,  right-hand  rule,  sense  through  angle  2cp. 
One  envisions  that  the  phase  can  be  counted  through  multiple  rotations  of  the  fiducial  paddle.  As  pointed 
out  earlier  [11,  12],  the  projection  of  one  unit  state  onto  another  is  effected  by  moving  it  (and  its  fiducial 
paddle)  in  parallel  transport  across  a  geodesic  to  that  state:  the  magnitude  of  the  projection  is  the  cosine  of 
half  the  geodesic  angle,  and  the  phase  is  half  the  angle  between  the  fiducial  paddles.  In  view  of  Eqn  (51), 
the  arbitrary  state  Eqn  (55),  and  the  half-angle  prescription,  we  can  see  from  Fig.  10  that  the  projection  of 
general  state  \ifj)  is  given  by  (1|  i/j)  =  cos  | 


Generalization  to  operators  with  overall  phase 

Operators  which  preserve  the  norm  of  the  vector  operands  are  unitary,  so  that  U'U  =  I.  Since  this  is 
a  set  of  4  equations  in  the  8  unknowns  (4  complex  elements  of  U),  there  must  be  4  DOF  for  the  matrix 
elements  of  U.  We  have  seen  above  that  when  the  overall  phase  is  suppressed,  an  operator  can  be  expressed 

4We  represent  states  |2)  ,  |o),  the  odd  superposition,  and  |— ),  the  phase  retarded  superposition,  as  “hidden”  behind  the  sphere. 
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as  a  3D  vector  k  and  an  equivalent  matrix  operator  Cg^.  If  we  include  the  overall  phase,  Eqn  (37)  shows 
that  all  we  need  do  is  multiply  the  state  by  e~^,  where  6  is  the  integrated  phase  up  to  point  z.  Then,  in  light 
of  Eqn  (49),  unitary  operators  can  be  cast  in  the  form 


U  =  e 


_  _  e-j2<t>/2 


<t>  ■  ■ 

cos  -(To  -jsin-(x-  a) 


(56) 


It  is  clear  from  this  form  that  U  is  unitary.  In  explicit  matrix  form,  this  operator  is 

U {4>,  </>,  k)  =  e~j2&2 


cos  |  -  j  sin  |  (k3)  -j  sin  f  («i  -  jn2) 


2  J  2  J  2 

.  -j  sin  |(/ci  +  jn2)  cos  §  -  j  sin  f  (-k3)  _ 


(57) 


where  the  m  are  the  components  of  unit  vector  k.  The  effect  of  this  operator  is  shown  in  Fig.  11,  and  can 
be  connected  to  Fig.  10  in  the  following  way.  The  spectral  theorem  [16],  in  light  of  Eqn  (49),  implies  that, 
the  eigenstates  ±k  with  eigenvalues  ±1  will  pick  up  a  phase  difference  of  ±kz.  Thus,  in  a  system  in  which 
the  eigenstates  are  a  basis,  the  state  will  rotate  about  k  (eigenvalue  +1)  through  angle  cj>  =  2Rz,  picking  up 
relative  phase  cf)  in  this  basis  in  a  similar  way  as  in  Fig.  10.  But  this  is  happening  with  respect  to  the  slow 
eigenstate  |/c),  not  |1).  At  the  same  time,  the  state  also  acquires  an  overall  phase  cf>  from  the  operator,  which 
further  rotates  the  fiducial  paddle  by  2f>  about  the  s  axis.  As  a  result,  the  evolution  is  depicted  as  in  Fig.  11. 
Thus,  the  operator  rotates  the  initial  state,  vector  s)  with  its  fiducial  paddle,  through  angle  cf)  to  its  final  state, 


Fig.  11:  Geometrical  representation  of  unitary  operator.  Eigenstate  of  operator  with  eigenvalue  +1  is  k, 
with  co-ordinates  K.t  in  Eqn  (57).  Initial  state  is  represented  by  s)  and  a  fiducial  paddle  to  represent  phase. 
Operator  rotates  s)  through  angle  cf  about  k,  while  simultaneously  rotating  the  fiducial  paddle  about  s. 

while  simultaneously  rotating  the  fiducial  paddle  about  s’  s  axis  through  angle  2<fi. 
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Application  to  birefringent  propagation  and  optical  coupling 

As  mentioned  above,  both  propagation  in  birefringent  optical  media  and  evanescent  coupling  in  adjacent 
waveguides  arc  usually  represented  with  the  formalism  described  in  Eqn  (36).  We  briefly  review  these  two 
applications  here. 

In  optical  propagation,  </>  corresponds  to  the  overall  propagation  phase,  i.e.  the  average  “optical  path 
length”  of  the  medium.  The  evolution  operator  represented  by  the  magnitude  and  unit  vector  of  k,  the 
medium’s  birefringence,  which  is  commonly  referred  to  as  3.  If  |1)  is  illustratively  chosen  as  the  x  axis, 
c:>,  then  ei  is  linearly  polarized  light  at  45°  to  the  x  axis,  and  C2  is  right-circular  polarization.  If  the  “slow” 
eigenstate  were  at  45°,  and  y  polarized  light  were  introduced,  the  state  would  evolve  by  precessing  about  ei 
in  a  right-hand  sense,  from  —  e.3  (y  polarization)  to  e-2  (right  circular  polarization,  to  c.>  (x  polarization),  to 
— e2  (left  circular  polarization),  and  back  to  — e>. 

The  same  sort  of  evolution  occurs  for  the  ideal  3  dB  coupler.  Now,  however,  the  states  1 1)  and  \2)  corre¬ 
spond  to  waveguides.  For  symmetric  identical  waveguides,  the  “slow”  eigenstate  is  the  even  superposition 
of  individual  waveguide  modes.  Thus  k  lies  along  the  ei  axis,  as  above.  Light  introduced  into  waveguide  2 
would  be  represented  as  —£3,  and  the  evolution  would  proceed  as  described  in  the  paragraph  above.  In  this 
case,  however,  the  states  arc  interpreted  as  (i)  light  in  waveguide  2,  (ii)  light  evenly  divided,  but  waveguide 
l’s  phase  advanced  by  7t/2  with  respect  to  2’s,  (hi)  all  light  in  waveguide  1,  and  (iv)  light  evenly  divided  but 
waveguide  l’s  phase  retarded  by  7r/2  with  respect  to  2’s,  and  (v)  back  to  all  light  in  waveguide  2. 

Conclusion 

This  Appendix  has  reviewed  state  evolution  in  the  basic  2D  form  and  its  representation  in  3D.  We’ve 
shown  that  conservative  systems  arc  characterized  by  unitary  evolution  operators,  that  the  overall  phase  of 
an  operator  can  be  suppressed  and  re-introduced  at  will,  that  the  operator  eigenstates  arc  parallel  or  anti¬ 
parallel  to  the  representation  of  the  operator,  and  that  state  evolution  under  a  constant  operator  is  given 
by  a  precession  of  the  state’s  3D  vector  about  the  operator’s  vector.  The  overall  phase  was  shown  to  be 
represented  by  a  fiducial  paddle  which,  in  addition  to  being  rotated  rigidly  with  the  state,  is  also  rotated 
about  the  state’s  3D  vector  by  twice  the  average  phase  introduced  by  the  operator. 


